t-tests, non-parametric tests, and large studies—a paradox of statistical practice?

نویسنده

  • Morten W Fagerland
چکیده

BACKGROUND During the last 30 years, the median sample size of research studies published in high-impact medical journals has increased manyfold, while the use of non-parametric tests has increased at the expense of t-tests. This paper explores this paradoxical practice and illustrates its consequences. METHODS A simulation study is used to compare the rejection rates of the Wilcoxon-Mann-Whitney (WMW) test and the two-sample t-test for increasing sample size. Samples are drawn from skewed distributions with equal means and medians but with a small difference in spread. A hypothetical case study is used for illustration and motivation. RESULTS The WMW test produces, on average, smaller p-values than the t-test. This discrepancy increases with increasing sample size, skewness, and difference in spread. For heavily skewed data, the proportion of p<0.05 with the WMW test can be greater than 90% if the standard deviations differ by 10% and the number of observations is 1000 in each group. The high rejection rates of the WMW test should be interpreted as the power to detect that the probability that a random sample from one of the distributions is less than a random sample from the other distribution is greater than 50%. CONCLUSIONS Non-parametric tests are most useful for small studies. Using non-parametric tests in large studies may provide answers to the wrong question, thus confusing readers. For studies with a large sample size, t-tests and their corresponding confidence intervals can and should be used even for heavily skewed data.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

بررسی تاثیر

  Introduction: Balance and gait disorders are common motor complications after stroke. Studies have revealed that conventional physiotherapy cannot manage these disorders efficiently so more studies addressing the causes of these complications and presenting efficient treatment protocols are crucial.   Methods: Thirty hemiparetic patients (age range 40-60 years old) participated in this experi...

متن کامل

On the non-parametric multivariate control charts in fuzzy environment

Multivariate control chats are generally used in situations where the simultaneous monitoring or control of two or more related quality characteristics is necessary. In most processes in the real world, distribution of the process characteristics are unknown or at least non-normal, so the non-parametric or distribution-free charts are desirable. Most non-parametric statistical process-control t...

متن کامل

Statistical and Practical Significance of Articles at Sports Biomechanics Conferences

Background. The importance of using statistical approaches has increased and became necessary for researchers and specialists in sports biomechanics because they need more objective and accurate methods to increase knowledge. Objectives. Evaluate the reality of using practical significance in the articles published in scientific conferences in the biomechanical sport. Methods. One hundred twe...

متن کامل

Validation of drop plate technique for bacterial enumeration by parametric and nonparametric tests

Drop plate technique has a priority and preference compared with the spread plate procedure, because of less time, quantity of media, effort requirement, little incubator space, and less labor intensive. The objective of this research was to compare the accuracy and fidelity of drop plate method vs. spread plate method by parametric and nonparametric statistical tests. For bacterial enumeration...

متن کامل

The Impact of Task-based Instruction on the Enhancement of Iranian ‎Intermediate EFL Learners’ Speaking Skill and Emotional Intelligence

This study tried to investigate the impact of task-based instruction (TBI) on the enhancement of Iranian EFL learners’ speaking skill. The study also tried to scrutinize the impact of TBI on learners’ emotional intelligence. To meet these ends, 60 students were randomly divided into two groups, the experimental group and the control group. At the very first session of the term, two speaking exa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 12  شماره 

صفحات  -

تاریخ انتشار 2012